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■ Abstract 
Kautz and de Bruijn graphs have a high degree of connectivity which makes them 

ideal candidates for massively parallel computer network topologies. In order to realize 
a practical computer architecture based on these graphs, it is useful to have a means 
of constructing a large-scale system from smaller, simpler modules. In this paper we 
consider the mathematical problem of uniformly tiling a de Bruijn or Kautz graph. 
°k^l | This can be viewed as a generalization of the graph bisection problem. We focus on 

the problem of graph tilings by a set of identical subgraphs. Tiles should contain a 
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maximal number of internal edges so as to minimize the number of edges connecting 
distinct tiles. We find necessary and sufficient conditions for the construction of tilings. 
We derive a simple lower bound on the number of edges which must leave each tile, and 
construct a class of tilings whose number of edges leaving each tile agrees asymptotically 
in form with the lower bound to within a constant factor. These tilings make possible 
the construction of large-scale computing systems based on de Bruijn and Kautz graph 
topologies. 
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1 Introduction 



The family of graphs known as de Bruijn graphs [T], Kautz graphs [2], and generalized de 
Bruijn and Kautz graphs [HI HI El E], have closely related mathematical structure. These 
graphs exhibit a high degree of connectivity, which makes them natural candidates for mas- 
sively parallel computer network topologies [7]. In particular, for graphs of degree K and 
diameter N, de Bruijn and Kautz graphs achieve the largest possible asymptotic number of 
vertices (~ K N ) [8]. 

The efficient connectivity of de Bruijn and Kautz graphs makes it difficult to partition 
these graphs into collections of subgraphs in such a way as to minimize the number of edges 
connecting the subgraphs to one another. Precisely such a partition is desirable, however, 
in order to physically realize a computer network topology based on a graph of this type. 
The problem of constructing a computer based on a de Bruijn or Kautz graph from a set 
of isomorphic subgraphs (circuit boards) connected together by a minimal number of edges 
motivates the problem we consider in this paper of tiling de Bruijn and Kautz graphs with 
isomorphic subgraph tiles having a maximal number of internal edges. 

In this paper, we define the tiling problem for de Bruijn and Kautz graphs. This problem 
can be thought of as a generalization of the graph bisection problem applied to these graphs, 
where instead of splitting the graph into two parts we wish to split the graph into a larger 
number of equal-size components. We derive a lower bound on the number of edges which 
must connect the tiles of a tiling. Based on the criterion that a tile decomposition should 
be scalable to construct graphs of arbitrary size, we give necessary and sufficient conditions 
for the construction of a uniform tiling. We construct a family of scalable tilings which 
asymptotically realize the lower bound on connecting edges to within a constant factor. We 
also provide examples of optimal tilings for small tiles. Finally, we show that not only de 
Bruijn and Kautz graphs but also their generalizations can be tiled in a similar way using 
the basic mathematical structure underlying de Bruijn graphs. 

The structure of this paper is as follows. In section 2, we fix notation and state the 
fundamental problem addressed in the paper. In section 3, we derive a simple upper bound on 
the number of edges which can be incorporated into a tile of given size, equivalent to a lower 
bound on the number of connecting edges. In section 4, we prove necessary and sufficient 
conditions for the construction of a tiling of a de Bruijn or Kautz graph, and describe a 
general approach for the construction of tilings. Section 5 describes the generalization of 
the results of section 4 to generalized de Bruijn and Kautz graphs. Section 6 contains 
some examples of tilings constructed using the method of section 4. In section 7 we give 
an explicit construction of a family of tiles which asymptotically realize the lower bound of 
section 3 to within a constant factor, as the tile size becomes large. Section 8 contains a brief 
discussion of the application of the tiling approach developed in this paper to supercomputer 
architectures, including a mathematical argument for the optimality of degree 3 Kautz/de 
Bruijn graphs in certain contexts. Section 9 contains some brief concluding remarks. 

A related formulation of this problem was given in [9] , along with some explicit decompo- 
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Figure 1: The de Bruijn graph B\ (a) and Kautz graph /C| (b) of degree 2 and diameter 2. 

sitions of degree 2 de Bruijn graphs into isomorphic subgraph tiles similar to those described 
in this paper. We focus in this paper on the directed graph (digraph) form of de Bruijn and 
Kautz graphs. Similar considerations could be used for tiling the undirected forms of the 
graphs. 



2 Definitions and problem statement 

We begin by defining some notation. 

First we define a (directed) de Bruijn graph. The de Bruijn graph of degree K and 
diameter iV is a digraph with K vertices. The vertices can be labeled by strings of N 
integers base K 

c\c 2 ---c N e z%, (l) 

so Cj G {0, 1, . . . , K — 1}. The de Bruijn graph B^ has edges 

(cic 2 • • -c N ,c 2 c 3 ■ ■ -c N c N+ i) (2) 

for all values of Cj, % 6 {1, . . . N + 1}. There is thus a natural one-to-one map between edges 
of Bk and vertices of B^ +1 . This leads naturally to an inductive construction of B^ +1 as a 
family of iterated line digraphs. 

A (directed) Kautz graph /C^ +1 of degree K and diameter N + 1 can be defined similarly 
as a digraph with (K + 1)K vertices labeled by strings 

SqSi ■ ■ ■ Sn E ^K+l ' Si ^ Si + l ' (3) 

where the integers Si are taken base K+l, but subject to the condition that adjacent integers 
must be distinct. The edges of /C^ +1 are given in a similar fashion to (J2J) by 

{(sqSx - ■ ■ s N ,s 1 s 2 - ■ ■ s N s N+1 )\ Si G Z K+1 Si ^ s i+1 } . (4) 

Some simple examples of de Bruijn and Kautz graphs for small values of the degree K 
and diameter are depicted in Figures [Q13] 
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Figure 2: The de Bruijn graph B\ (a) and Kautz graph (b) of degree 2 and diameter 3. 

Kautz graphs and de Bruijn graphs both have uniform in-degree and out-degree K. Kautz 
graphs and de Bruijn graphs are closely related. Indeed, if we take a Kautz graph /C^ +1 
with vertices sq ■ ■ • sn, we can map the vertices to the vertices of the de Bruijn graph B^ 
through 



This map is a (many to one) graph homomorphism, since if (s • • • $n, Si • • • $n+i) is an edge 
of JC^ +l it follows immediately that the associated (ci • • • cjv, c 2 • • • c/v+i) is an edge of B^. We 
can thus label every vertex in the Kautz graph by s^v, where so 6 I*k+i and v = cic 2 • • • cjy 
is a vertex in a de Bruijn graph B^. The edges of the Kautz graph in this notation are 



where [sq + c± + 1] is taken modulo if + 1. We will use this relationship between Kautz and 
de Bruijn graphs in the following section. 

Notation: if a directed graph has an edge (u, v) we say that v is a child of u and u is a 
parent of i>. We denote the set of children of u by C(u) and the set of parents of v by P(v). 

We are interested in finding tilings of de Bruijn and Kautz graphs with certain properties, 
where a tiling is defined as follows: 

Definition: A tiling of a graph G is a one-to-one (on vertices) graph embedding t : I x Vr — > 
Vg taking the product of an index set I = {1,2, ... , \ Vq\/\ Vt\} and the vertices Vr of a smaller 
graph T (the tile) into the vertices of the graph G such that if (x, y) is an edge of T then 
(t(i, x),t{i, y)) is an edge of G, Vi. 

We wish to find tilings of the de Bruijn and Kautz graphs B^ and /C^ +1 such that the 
number of edges of the tile T is maximized. This problem is motivated from the problem of 
designing a computer with a de Bruijn or Kautz topology, where we wish to construct the 
machine using a set of identical boards (the tiles), minimizing the number of wires connecting 
the boards by maximizing the number of on-board connections corresponding to edges of T. 
Problem 1 For fixed K, N , and tile size S = \ Vt\ such that S\K N , find a tiling of B^ which 



r : s • • • Sn — » C\ ■ • • Cjv, Q = Sj — Sj_i — l(mod K + 1). 



(5) 



(s cic 2 ■ • • c N , [s + Ci + l]c 2 c 3 ■ ■ • C N+ i) 



(6) 
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(a) (b) 
Figure 3: The de Bruijn graph B\ (a) and Kautz graph /C§ (b) of degree 3 and diameter 2. 

maximizes \Et\, the number of edges in T. 

This problem has previously been formulated in the case K = 2, and some example 
tilings described in [9]. 

Problem 2 For fixed K, N, and tile size S\ (K+ 1)K N , find a tiling of /C^ +1 which maximizes 
\Et\- 

Definition: a tiling is scalable if the associated tile T can be used to tile a de Bruijn or 
Kautz graph of any size K N or (K + 1)K N > \Vr\ (for fixed K). 

This property is useful practically because machines of different sizes can be built with 
the same modular component. The condition that the tiling be scalable also enables us to 
make more general statements about the set of allowed tilings. 

Problem 3: For a fixed degree K and tile size S, find a tile T with S = \Vt\ and maximum 
\Et\ which leads to a scalable tiling for either de Bruijn or Kautz graphs. 

When K is prime, any tile must have size K M for some M. For non-prime K, it may be 
possible to find tiles of other sizes giving scalable tilings. We will focus in this paper on tiles 
with sizes which are powers of K. Such tiles can be constructed for all K. 

Definition: We define the maximum number of edges E? which can be realized in any tile 
which has K M vertices and which gives a scalable tiling for de Bruijn and Kautz graphs of 
degree K to be Ek,m- 

Definition: We define the minimum possible number of "broken" edges in any tile with 
K M vertices to be B KM = K M+1 - E K>M . 

In this paper we construct a class of tilings which turn out to have the following useful 
property 

Definition: a tiling has the parallel routing property if for each vertex x G Vr, the children 
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of t(i, x) are mapped under t 1 to vertices in / x V T with the same set of Vr values (for each 
i). In other words, defining the projection II of a vertex in G to the tile T through 



U(t(i,x)) = x Vi (7) 
the parallel routing property states that 

IL(C(t(i,x))) = IL(C(t(i',x))) (8) 

for any i, il in the index set /. 

This property is useful practically because it simplifies the wiring of a machine based on 
a tiling with this property. 



3 Upper bound on internal tile edges 

In this section we derive an upper bound on the number of edges in the tiles of a tiling, 
equivalent to a lower bound on the number of edges which must leave each tile. We assume 
here that tiles are of size K M \ a similar bound can be found for tiles with sizes not of this 
form for composite K by the same analysis. The lower bound found here takes a simple 
asymptotic form as M — > oo for fixed K. We focus on de Bruijn tilings, but analysis of 
Kautz tilings gives the same form for the bound. 

Theorem 1: The number of edges achievable in a tile which leads to scalable tilings has an 
upper bound 

E k ,m < K M+1 - max^l (K M+1 - K 2M+1 - N ) (9) 

Proof: 

We find this upper bound by considering a set of paths which leave the tile; we bound 
the number of these paths which cross any given edge, which gives us a lower bound on the 
number of edges which must leave the tile. This basic method, described in [TT], is used in 
[TO] to construct a lower bound on the edge bisection width of de Bruijn and Kautz graphs. 

Consider a tiling of the de Bruijn graph by tiles of size K M . For a given tile T, 
consider the K M (K N — K M ) paths of length N which go from a node in T to a node in the 
de Bruijn graph outside tile T. Each of these must cross at least one edge which begins at 
a node in T and ends at a node outside T. We refer to such at edge as an edge "leaving" 
the tile T. Each edge leaving the tile is traversed by at most NK N ~ l of the paths under 
consideration (since the edge fixes iV + 1 of the 2N q's defining the path, but may appear 
at any of TV places in the path). Thus, the number of edges leaving the tile (K N+1 — Et) is 
at least 

K +1 -E T > [ AT j = - (K M+1 - K 2M+1 - N ) (10) 
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for any choice of paths, and for any tile T. Thus we have shown 

B K , M > ± {K M+1 - K™^~ N ) (11) 
for any N, which proves the theorem. 

Corollary For fixed K and large M the lower bound on the number of edges leaving the 
tile goes asymptotically as 

B K>M > ± {K M+1 ) (1 - OQogM/M)) . (12) 

Proof: 

By taking N = M + [log^ M\ , we have 

B K , M > i {K M+l - K 2M+1 - N ) ~ ± (K M+1 ) (1 - ©(log M/M)) . (13) 

In Section [7] we explicitly construct a class of tilings which realize this asymptotic bound to 
within a factor of 2. 

The asymptotic lower bound we have found here for the number of broken edges is similar 
in form to that found in [10] for bisection of de Bruijn and Kautz graphs. Those authors 
found that the edge bisection width b e satisfies the asymptotic inequalities 

K N+l 9K N+1 i— 

— (1 - 0(l/K™)) < b e (K, N) < — ^(1 + 0{N/K^)) . (14) 

Note that the asymptotic form of the lower bounds agrees for K — 2, iV = M + 1, where the 
tiling and bisection problems are equivalent. 



4 Structure of tiles 

In this section we prove a necessary condition which a tile of size K M must satisfy to lead 
to a scalable tiling. We then prove a related but slightly stronger sufficient condition for 
scalable tilings, by giving a general construction of a class of tilings based on any tile which 
satisfies the sufficient condition. 
We begin with a few definitions 

Definition: A stratification of a directed graph G is a map a : Vq — > Z from the vertices of 
G into the integers such that for every edge (u, v) in G, we have 

a(u) = a{v) + 1 . (15) 

We refer to u{u) as the level of the vertex u in the stratified graph G. By convention, we 
take the smallest level in a stratification to be by shifting all levels equally. We refer to a 
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graph which admits a stratification as stratifiable. 

Definition: A loop is a directed graph which is topologically equivalent to a circle. 

Note that the edges in a loop need not have compatible orientations. It is straightforward 
to show that a loop is stratifiable if and only if it contains an equal number of edges of each 
orientation. For example, the loop defined by the graph containing 3 vertices u, v, w and edges 
{(u, v), (v,w), (u,w)} is unstratifiable since a stratification would give a{u) = a{v) + 1 = 
o~(w) + 2 from the first two edges and cr(u) = a(w) + 1 from the third edge. 

Lemma 1: A graph G is stratifiable if and only if it contains no unstratifiable loops as 
subgraphs. 

Proof: If G contains an unstratifiable loop L then G is clearly not itself stratifiable since 
any stratification of G would provide a stratification of L. If G contains no unstratifiable 
loops, then we can explicitly construct a stratification of G by choosing a single reference 
vertex in each connected component of G and assigning the level of each other vertex w in 
G by implementing the rule ( TT5l) on each of the edges needed to reach w on any path from 
a reference vertex, allowing the path to traverse edges in either direction. In the absence of 
unstratifiable loops, this assignment is path independent and provides a stratification of G. 

We can now prove the following 

Theorem 2: A necessary condition for a tile T of size K M to give scalable tilings of B^ 
and /C^ +1 for all iV > M is that T must be stratifiable. 

Proof: We begin by assuming that T is unstratifiable and therefore contains an unstratifiable 
loop. We will show that this assumption leads to the conclusion that T cannot be used to 
construct scalable tilings, thus proving the theorem by contradiction. 

Assuming that a given tile T contains an unstratifiable loop L of length n, we write L in 
terms of the sequence of vertices in T it traverses. 

L = [ux,u 2 , ■ ■ ■ , u n , u n+ x = ui] (16) 

u i+ i e C(ui),i e 1+ 
u i+ i g P{v,i),i e L 

where l+,l- denote the sets of indices of vertices associated with edges (ui,u i+ i) which are 
traversed in a forward/backward direction in the loop L, so that L PI /+ = {},/- U 1+ = 
{1,2, ... ,n}. Since L is unstratifiable, we must have 

\l + \^\l-\. (17) 

Now, imagine that the tile T gives a scalable tiling of de Bruijn graphs for N > M 
but nonetheless contains the unstratifiable loop L given in ( !T6|) . In a tiling of the graph 
Br, there will be K N ~ M nodes v which project to each node u € Vr through u = II(t>). 
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In particular, there will be K N ~ M nodes = t(i,Ui) with U(v^) = u\. Choose any such 
node V\ = c±- • • C/v which can be written as t(i ,Ui) = V\ for some fixed i . Then we can 
define a loop in through 

Vi = t(i ,Ui) (18) 

whose vertices must satisfy 

v i+1 e C(vi), if % e l+, (19) 

v i+1 G P(vi), Hi El- (20) 

Since |/ + | ^ when we follow the loop around, the string of characters defining V\ has 
been shifted to the left \l + \ — \l-\ ^ times. But we have returned to v\ after going around 
the full loop; this implies that the bulk of the word v\ — c± • • • cn is invariant under shifting 
left by \l + \ — Specifically, 

Cj = Cj + A, where A = — i > n, i < N — n — A (21) 

But only K 2n+A nodes have this property (such nodes are completely defined by giving 
Ci • • • c n , Cjv-n-A+i • • • Cat), so we cannot have K N ~ M nodes with H(v) = U\ when > 
M + 2n + A. Thus, the assumption of an unstratifiable loop is incompatible with the 
assumption of a scalable tiling for T, so we conclude that if T can be used to construct a 
scalable tiling it cannot contain an unstratifiable loop, and therefore must be stratifiable. 
The theorem is thus proven. 

The condition of stratifiability is not by itself sufficient to guarantee that a tile gives rise to 
a scalable tiling. In some cases, stratifiable loops can form obstructions to the construction of 
a tiling. We now will proceed to prove that scalable tilings can be constructed on stratifiable 
tiles with certain types of loops. First, however, we need a few more definitions and a lemma. 

Lemma 2: A tile of size S = K M giving scalable tilings of and /C^ +1 for > M must 
be a stratifiable subgraph of Bft 

Proof: The tile must be stratifiable by Theorem 2. It must be a subgraph of B^ since it 
must tile B% with N = M. Note that since S = K M , the subgraph contains all vertices of 
Bki but will only contain a subset of the edges. 

We can thus associate with any scalable tiling of a de Bruijn or Kautz graph G = B 1 ^ or 
a map n : Vq — > V b m. For the tilings which we will construct, this map extends to a 
complete graph homomorphism II : G — » B^ . We define several properties for maps of this 
type 

Definition: A map n : G — ?■ B^ has the parent distribution property when 

n(p(«)) = P(n(«)), VueG (22) 
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Definition: A map II : G — > has the child distribution property when 

U(C(u)) = C(U(u)), Vw G G (23) 

Note in particular that if II : G — > B^ has both the parent and child distribution 
properties, then for any vertex u G G with H(u) = di ■ ■ ■ du G B^ , there is a unique parent 
Pu,x G P{u) of u with Yi{j> u ^ x ) = xdi ■ ■ ■ djif-i and a unique child c UjX G C{u) of u with 
n(c Uj2; ) = <i 2 ■ • • dM% for each x G Z^. 

We now prove by example that there exist maps II : G — > B^ with G = B^- or /C^ +1 
which have both the parent and child distribution properties, for any N > M 

For a de Bruijn graph B^, we begin by defining the kth discrete differentials. For each 
vertex v = C\ ■ ■ ■ cjy we define 

d?(«) = Cj 

= Cj +fe — Cj (mod K), i — 1, . . . , N — k, k > 0. (24) 
Let us fix iV > M and take k = N — M. Then the discrete differentials define a map 

cf : ci ■ ■ -cjv -> di ■ ■ -djv-fe, = df^ci ■ ■ ■ c N ) (25) 
which is a graph homomorphism 

d:B%^ B% (26) 

since if (c± • • • cn, c 2 • • • cn+i) is an edge of B^ then (d\ ■ ■ ■ d^-h, di • • • djv-fc+i) is an edge 
of Bft~ k . It is straightforward to check that this graph homomorphism has both the child 
and parent distribution properties. Note that using this homomorphism we can relabel the 
vertices of the graph G = B^ using the first k c's and the gPs 

Vg = {c\-~ c k di ■ ■ ■ d N _ k } . (27) 

Combining the discrete differentials on the de Bruijn graph B^ with the homomorphism 
([5]) gives us a map 

d:JC% +1 ^B™ (28) 

defined through d{u) = d(r(u)), with r defined as in ([5]). Again, it is easy to verify that 
this graph homomorphism has the child and parent distribution properties. Using this 
homomorphism, we can label the nodes in a Kautz graph H = /C^ +1 through 

V H = {s ci ■ ■ ■ c k dx ■ ■ ■ d N - k \s G Zk+i, ci, • • • , d N - k G Z K } . (29) 
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Definition: We define the height of a stratifiable loop L to be the difference a max — a m - m 
between the largest and smallest levels in a loop. 

We can now prove the following 

Theorem 3 Given a stratified tile T of size K which is a subgraph of Bf< (containing all 
vertices but not all edges of ), tilings of G = B^ and /C^ +1 can be constructed for all 
iV > M, as long as T contains no loops of height > M. Such a tiling can be constructed 
explicitly from any map IT : G — > Bf^ with the child and parent distribution properties. 

Proof: We showed above by explicit construction that a map IT : G — > B^ with the child 
and parent distribution properties exists for any M, for G a de Bruijn or Kautz graph of 
degree K with N > M. Let us take any particular such IT. From II we can define the tiling 
constructively as follows: choose a vertex x — d\ ■• ■ du of the tile T. Associated with this 
vertex of T there is a set of vertices in our de Bruijn or Kautz graph G which map to xq 
under IT. We can arbitrarily associate this set of vertices with the elements % in the index set 
/ to define t(i, Xq) for all i. For example, using the discrete differential map described above, 
given a specific Xq — d\ • ■ ■ 6m we can define the index set by the leading indices before the 
d's on this set of vertices in the notation of (12?] 129]) (i.e., ci • • • c& for de Bruijn, s Ci ■ ■ ■ c*. 
for Kautz). 

We have now defined t(i,Xo) for all i and for a specific Xq in Vp- Now, if there are any 
edges in T containing x (i.e. directed edges (xo,x') or (x',xo) beginning or ending on Xo), 
we can extend the definition. Say T contains the edge (xq,xi). For each i <E I, using the 
child distribution property, there is a unique vertex i>, in G which is a child of t(i, x ) and 
which has Tl(vi) = x%. We can thus define t(i,X\) = Vi for each i. These vertices Vi,Vj are 
distinct for i ^ j, by the parent distribution property — if V{ = Vj then t(i,x ) and t(j,x ) 
would both be parents of the same vertex with the same value of IT, which is impossible 
given the parent distribution property. Instead of extending along an edge where x is the 
parent, we could equivalently extend along an edge where x is the child, using an equivalent 
argument where the roles of parents and children are exchanged. We continue in this way 
for further edges containing either xq or X\. Each time we include a new edge we define the 
map t for a new value of x. If the graph T is disconnected, we run out of edges before the 
map t is completely constructed. In this case we choose a new vertex x' where t is not yet 
defined and proceed as above with x' in place of xq. In this fashion, we can construct the 
complete tiling. 

It remains to be shown that the construction of the tiling we have just presented is 
well-defined in that any loops existing in the tile do not lead to incompatible definitions 
for the tiling by following different paths to extend the definition of the tiling. This is 
straightforward to demonstrate for any loop whose height is less or equal to M. Given a 
loop L C T of height h < M we need to show that the loop can be consistently lifted to a loop 
in the graph G by following the edges and using the child and parent distribution properties 
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as in the above construction. Let us start at a vertex X\ G L such that a(y) > a{x\) G L. 
We lift X\ = di ■ ■ -(1m to a vertex u± G G with H(ui) = x\. Labeling the vertices of the 
loop in order as [xi, . . . ,x s , x s+ i = x\] where s is the length of the loop we can use the child 
and parent distribution properties to lift each xi to a vertex U{ G G as above, following the 
edges of the loop one after another. We need to show that when we return to x\ the final 
value of u s+ i is the same as u\. Assume that G = B^ — a similar argument proceeds for 
Kautz graphs. We can write U\ — c\ ■ ■ ■ c^. From the form of the de Bruijn graph edges, 
and since X\ has the lowest level of any vertex in the loop L, we see that all vertices Ui 
contain the coordinates c\ ■ ■ -cn-h, shifted to the right by a(xi) — a(xo) places. But then, 
u s+i = Ci • • • CN-hCN-h+i ■ ■ ■ Cat. Since by the child distribution property there is a unique 
Mth child of z\ ■ ■ ■ zmC\ ■ ■ ■ cn-m with II = Xi, we must have u s+ \ = u\ when h < M, so the 
loop of height h < M does not obstruct the construction of the tiling. This completes the 
proof of the theorem. 

Corollary: The tilings constructed by the method of Theorem 3 have the parallel routing 
property. 

Proof: This follows from the use of II with parent and child distribution properties, since 

IL(C(t(i, x))) = C{U{t{i, x))) = C(x) (30) 

is independent of i so (jSJ) holds for all i,i',x. The parallel routing property can also be seen 
to follow immediately from the fact that II is a graph homomorphism. 

We have now provided results which make it possible to systematically search for optimal 
solutions to Problem 3. We know that for tile size S = K , scalable tilings are given by 
stratifiable subgraphs of . We can thus consider the set of stratifiable subgraphs of £>|f , 
ordered by their number of edges. We know that all stratifiable subgraphs with no loops 
of height h > M can give scalable tilings. Thus, if a stratifiable subgraph of B^ can be 
realized with the maximal number of edges and h < M, this provides a solution to Problem 
3. In some cases, the stratifiable subgraph of B^ with the maximal number of edges may 
have a loop of height h > M. In such further check is needed to verify that such 

a subgraph gives a scalable tiling; generally this will not be possible, but it may be possible 
in special cases. In any case, finding a subgraph of B% with the maximal number of edges 
subject to the condition that there are no loops of height h > M will give rise to a good set 
of scalable tilings. 

We conclude this section with some further comments. 

First, we note that the discrete differential construction above is not the only way to 
realize a map IT : G — > B% with the parent and child distribution properties. Indeed, 
consider any function 

f : Z K x Z K Z K (31) 

with the following properties: 
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• for each c G Z K , f(c, •) gives a one-to-one map from — > Z K (i.e., for fixed c, /(c, c') 
takes different values for each d) 

• for each d G Z#, f(-,d) gives a one-to-one map from Z K — > Z K (i.e., for fixed d, 
f(c, d) takes different values for each c) 

If we replace d\ = Q + k — Cj(mod K) in ( l24l) with any other function d\ = f(ci + k, q) with 
these properties, we see that the associated map II : G — > still has the child and parent 
distribution properties when G is either a de Bruijn or Kautz graph. Using such a more 
general map in the construction of Theorem 3 provides a more general class of constructions 
of scalable tilings with the parallel routing property. 

Note however that not every graph homomorphism of the form II : G — > has the 
parent and child distribution properties. For example, while the map B^- — > B 3 K defined by 
taking a contiguous subset c a c a+ i ■ ■ ■ c a +j-i of characters in the string c\ ■ ■ ■ cn is a homo- 
morphism, it does not have these properties, and indeed cannot be used to define a tiling. 
To see this clearly in a specific simple case, consider the case K = 3, iV = 3. We cannot tile 
the 27-node de Bruijn graph £>f using tiles with vertices addressed by C2C3. For example, 
the 3 parents of the vertex u = 001 have coordinates c cic 2 = 000, 100, 200. These vertices 
all have the same value of C2C3 = 00, so the graph homomorphism taking cqC\C2 —> C1C2 does 
not have the parent distribution property. Assume now for example that our tile includes 
the edge (00, 01). The vertex 101, which should be associated with the vertex 01 on the tile 
has no parent with C2C3 = 00, so the edge (00, 01) cannot be contained in the tile. Similar 
problems arise with any other edge, so we cannot tile £>| with tiles having vertices addressed 
by C2C3 and more than edges. 

5 Tilings of generalized de Bruijn and Kautz graphs 

In this section we describe tilings of generalized de Bruijn and Kautz graphs. So far, we 
have discussed only de Bruijn and Kautz graphs of sizes K N and (K + 1)K N respectively. 
Generalizations of the de Bruijn and Kautz graphs to other sizes are described in [3j HI |5l [6] . 
We show here that the construction of the previous section gives tilings by tiles of size 
S = K M for generalized de Bruijn and Kautz graphs with V = nS vertices for any integer 
n. 

5.1 Generalized de Bruijn graphs 

A generalized de Bruijn graph of degree K having V vertices can be defined by taking the 
directed graph on vertices < i < V with edges 

(i,(Ki + m) mod V), V m, <m < K . (32) 

If V = K , it is easy to see that this definition agrees with the one in (|2]) by simply taking 
ci ■ • • cn to be the base K representation of the integer i for each vertex i. Multiplication by 
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K simply shifts the base K representation left by one digit, adding m shifts in an arbitrary 
new digit cn+i and modding by V = K N truncates to N digits base K. More generally we 
can write any V = nK M in the form 

V = FK N , (33) 

where M > N and F and K are relatively prime, i.e. (F,K) = 1. We can then represent 
any vertex i < V in mixed-base form as 

fc\ • ■ • c^r, Ci<K,f<F. (34) 

As before, the transformation rule associated with edges (1321) shifts the c's left. This trans- 
formation rule takes jc\ ■ ■ • cn — > f'c 2 ■ ■ ■ cn+i where 

/' = (Kf + c\) mod F . (35) 

Since K and F are relatively prime, multiplication by K is invertable mod F, so that for any 
fixed Ci the map defined in (|35l) from /—>•/' is a one-to-one map. In particular, given c 1; /' 
there exists a unique / < F such that (|35|) is satisfied. This demonstrates that the map from 
the generalized de Bruijn graph with V vertices to the regular de Bruijn with K N vertices 
given by dropping the first digit / in (JMj) has the parent distribution property fl22l) . This 
map also clearly has the child distribution property ( |23j) . which follows from the fact that 
the children of any node with mixed-base representation jc\ • • • cjy are just fc>i ■ ■ ■ Cfqvn with 
/' given by (!35]l and < m < K. Since this map has both the parent and child distribution 
properties, so does any map to a smaller de Bruijn formed by the composition of this map 
with a map II' : — > also having the parent and child distribution properties. This 
shows that the tiles which can be used to construct tilings of standard de Bruijn and Kautz 
graphs in Theorem 3 of the previous section can also be used to tile generalized de Bruijn 
graphs. We give some examples of such tilings in Section [6] 

5.2 Generalized Kautz graphs 

Similar to the generalized de Bruijn graphs defined through ( 132|) . a Kautz graph of degree 
K having V vertices can be defined by taking the directed graph on vertices < i < V with 
edges 

(i, (—1 — Ki — m) mod K), V m,0 <m <V . (36) 

To relate this to the standard Kautz graph in the case V = (K + 1)K , we again introduce 
a mixed mode representation for each vertex i, writing V = FK N with (K,F) = 1. We 
denote the digits in this mixed mode representation by 

/cic 2 c 3 c 4 • • ■ ° N , a,Cj < K, f < F (37) 
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where the last digit is if N is odd, and if N is even. The barred digits are defined to 
be Q = K — 1 — Q, giving a shift register representation of the vertices. From (13 6 j) we see 
that the graph edges are given by 

(/cica • • • ^ , /'c 2 c 3 • • • N m) (38) 

Cat Cat 

where 

/' = (-1 - Kf — ci) mod F (39) 
= (-#/ + ci - mod F . (40) 

In particular, if we choose F — K + 1, we have X = — l(mod F), so /' = (/ + 1 + ci)mod F, 
which is precisely the transformation rule on edges in ([6]) if we identify / = sq. Thus, when 
F = K + 1 we have the standard Kautz graph, while for other F we have a generalization. 

As in the previous subsection, the invertibility of multiplication by K mod F guarantees 
that the map to a K N node de Bruijn given by dropping the first digit / in the mixed 
mode representation ( 157)) has the parent and child distribution properties, and therefore 
the generalized Kautz graphs can also be tiled using the methods of the previous section. 
Examples are given in the following section. 



6 Examples 

In this section we give some explicit examples of tilings constructed using the discrete dif- 
ferential method of Theorem 3. 

Let us first consider tilings of de Bruijn graphs with K = 2 by tiles of size 4. The de 
Bruijn graph B\ is shown in Figure [TJ Up to graph isomorphisms there are 4 kinds of loops 
in this graph: 

[00->00], [00^01^10^00], [00^01^10^00], [00^01^11^10^00], 

(41) 

where the second and third loops differ only in the orientation of the link connecting 01 
and 10. All these loops are non-stratifiable. Thus, there are no stratifiable tiles given by 
subgraphs of B\ with more edges than the tile with 3 edges: 

Et — {(00, 01), (01, 11), (11, 10)}. (42) 

This tile therefore gives a scalable tiling which solves Problem 3 for F = 2,S' = 4 = 2 2 . As 
an example of a tiling using this tile, consider the tiling of B\ using the map II described 
using discrete differentials (T26)) . This tiling is depicted in Figure HI In this figure the numbers 
in brackets are the addresses c\d\di for each node as in ( )27j) . The colored/bold links are 
those realized on a tile T described by ( l4"2j) . 
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Figure 4: Optimal scalable tiling of the de Bruijn graph £>f by tiles of size 4 

Let us now consider simple examples of a Kautz graph tiling with K = 2. The optimal 
size 4 (M = 2) tile for K = 2 again has no loops and has only 3 edges. We can, for example 
choose the stratified subgraph of E\ containing edges Et = {(00, 01), (10, 01), (01, 11)}, with 
a (00) = er(10) = 2,a(01) = l,er(ll) = 0. We can use the discrete differential map (EHJ) 
d : K\ — > B\ to label the vertices of the Kautz graph K\ by s^d^di- The tiling associated 
with this map is depicted in Figure where nodes are labeled by s sis 2 (the nodes are, 
however, ordered according to the index % from (1361)). In this figure, again colored/bold links 
are realized on a tile T. The direction of each link is indicated by using dotted lines on the 
outgoing part of the link and solid lines on the incoming part of the link. 

For K = 2, up to relabeling there is only one function of the form fl3TT) with the desired 
properties, which is the one used in the definition ff24|) of discrete differentials. For K = 3, 
it is easy to check that up to relabeling of the integers, which is a symmetry of both the 
de Bruijn and Kautz graphs, there are precisely two distinct functions with the desired 
properties: 

/i(c, c') — c — c' (mod K) (43) 

and 

f 2 (c,c) = c + c (mod K) (44) 

Either of these functions can be used to construct tilings with K = 3. 

Now, for K = 3, consider the tile with M = 1. The de Bruijn graph here is just the 3 
vertices 0,1,2 with edges going from each vertex to each other vertex. The largest subgraph 
without directed loops has two edges, such as — > 1 — > 2. This defines the best tile for a 
scalable tiling. Now consider M = 2. A systematic analysis of possible stratifiable subgraphs 
of $3 shows that the maximum number of edges compatible with stratifiability is 11, which 
can be realized for example by the tile depicted in Figure El This tile has 4 loops of length 
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Figure 5: Optimal scalable tiling of the Kautz graph by 3 tiles of size 4. Direction of link goes from 
dotted end to solid end. 

4 (of which 3 are homotopically independent), all of which are of height 1. Tilings of the 
generalized de Bruijn and Kautz graphs with 18 nodes by two copies of this tile are shown in 
Figures [7] and [HI In these figures the node numbers are the numbers % used in the description 
of generalized de Bruijn and Kautz graphs in Section |5j 

We have carried out a systematic analysis of all tiles for small values of K, N. We have 
computed the optimal tiles by performing a complete search over all possibilities in all cases 
where K N < 16. Our results are tabulated here. 



K 


N 




max \Et\ 


2 


2 


4 


3 


2 


3 


8 


8 


3 


2 


9 


11 


2 


4 


16 


19 


4 


2 


16 


27 



Using a simple greedy-first algorithm we have found some good, but not necessarily 
optimal tiles for larger values of K N ; as examples, for K = N = 3 there is a tile with 44 
internal edges, and with K = 3, N = 4 there is a tile with 150 internal edges. In the next 
section we discuss tilings for arbitrarily large values of M 
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00 




20 




Figure 6: Optimal tile for K = 3, M = 2. Tile has 11 internal edges (shown) and 7 external edges (not 
shown) 




Figure 7: Optimal scalable tiling of the generalized de Bruijn graph with K = 3, V = 18 by tiles of size 9 
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Figure 8: Optimal scalable tiling of the generalized Kautz graph with K = 3, V = 18 by tiles of size 9 

7 Asymptotically efficient tilings 

In this section we provide an explicit construction of an efficient class of tilings. This class of 
tilings realizes the asymptotic bound ffl2|) on edges leaving the tile to within a factor of 2. As 
we discuss below, we believe that this class of tilings may in fact be asymptotically optimal, 
and that the theoretical bound may be improvable by a factor of 2. We cannot, however, 
demonstrate this conclusively at this time. Note that for the graph bisection problem there 
is a similar gap (in that case by a factor of 4) between upper and lower bounds on edge 
bisection bandwidth (TTJJ for general K, N [10] . 

7.1 Local score-based tiles 

We will now define some tiles explicitly by choosing a stratification of the nodes in a de 
Bruijn graph and applying Theorem 3. The basic idea we will use to construct efficient 
tilings is the identification of local patterns in the dd (discrete differential) address space to 
choose the levels in the stratification of the nodes of the tile of size K M . 

For each node of the de Bruijn graph £>|f labeled by d± ■ • • du-, we assign to each position 
in the node address a score between and 1 according to how well the node address in the 
vicinity of di matches some desired pattern. We will use the convention that a smaller score 
indicates a closer match to the desired pattern. The idea is that if this score is very low at 
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some particular position i for a node x = d\ • ■ ■ d,M, then the score will generally be low for 
position i — 1 in the node addresses of all the children of x. Thus, if we choose the level of 
the node x to be o~{x) = i where i is the position of lowest score, then generically children y 
of x will have lowest score at position i — 1, and will be assigned level a(y) = i — 1, so that 
the edge (x, y) can be included in the tile. 

There are many ways in which we can define such a local pattern-based score. To give a 
concrete example we define such a score in a way which leads to efficient tilings. 

Definition: We define a K-ary expansion score by interpreting the sequence of digits be- 
ginning with di as a base K expansion of a real number between and 1 in the following 

way 



where we define do = and d n — K—\ for n > M. Thus for example, the node x = 0010100 
for K = 2,M = 7 would have fa(x) = 0.00100l 2 = 0.00101 2 = 5/32. The idea is that the 
position in the node address describing the smallest real value will determine the level of the 
node. The leading digit is complemented base K so that the starting point of this lowest 
value is clearly marked; without this complementation the resulting tile would have multiple 
broken edges whenever a very small number (long sequence of 0's) initiates the node address 
sequence d\ ■ ■ ■ du- The boundary conditions are similarly chosen to minimize broken edges. 
With this score function, we then define a tile by defining a stratification of nodes through 



There are (limited) circumstances in which two positions i can have the same value of fa(x) 
for a fixed x, leading to "ties" where multiple values of i satisfy ( 146]) . Such ties only occur 
for nodes in which the last n digits are all equal to K — 1 and all previous digits are < K — 1. 
In this case we define 

<r((d a < K —!)••• {d M . n < K — l)(rf M -n+i = K -!)■■■ (d M — K — 1)) — M. (47) 

With this assignment of levels, each of these nodes has K — 1 children at level M — 1 (those 
with d^i 7^ K — 1) and one child at level M giving a broken edge. 

Example: In the tile with K = 2, M — 6, consider the node d\---d$ = 010010. The 
smallest score is found at the position where the smallest number appears base 2, with the 
first bit in the sequence beginning at that point complemented. This is position i = 2, with 
2 (O1OO1O) = 0.000101 2 = 0.00011 2 = 3/32. So this node is placed at level tr(010010) = 2. 
Similarly, each of the child nodes 100100 and 100101 have the smallest fa at i = 1 so 
a(100100) = a(100101) = 1, so edges (010010, 100100) and (010010, 100101) are both 
included in the tile. Note that for the node 100100, fa (100100) = 5/64, while 4 (1OO1OO) = 
1/8. 




Vi : < i < M 



(45) 



a(x) =i: fa(x)< fa(x) Wj ^ i, < j < M. 



(46) 
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Figure 9: A tile with K = 2, M = 4 based on the 2-ary expansion score stratification. Nodes with the 
smallest string beginning at position i (preceded by a 1) are placed on level 5 — i, with boundary conditions 
and tie-break conditions as discussed in text. Tile is optimal, having 19 internal edges (shown) and 13 
external edges (not shown). 



We can use this method based on the K-aiy expansion score to construct the full tile for 
any K,M. For each node in the tile the level of the stratification is defined through (j4"6"|) . 
with (]4"TI) used to break ties. 

Example: For the case of K = 2, M = 4, the K-axy expansion score stratification gives 
the tile shown in Figure This tile has 19 internal edges and 13 broken edges. Thus, this 
algorithm for tile construction yields an optimal tile in this case. 

It is straightforward to automate the construction of tiles according to this system. The 
numbers of internal (broken) edges for tiles with small values of K, M are listed in the 
following table. 



K\M 


2 


3 


4 


5 


6 


2 


3* (5) 


8* (8) 


19* (13) 


42 (22) 


90 (38) 


3 


10 (17) 


41 (40) 


146 (97) 


485 (244) 


1559 (628) 


4 


23 (41) 


129 (127) 


615 (409) 


2729 (1367) 


11697 (4687) 


5 


44 (81) 


314 (311) 


1876 (1249) 


10414 (5211) 


55794 (22331) 



In this table we denote with an asterisk (*) cases where the tile is known to be optimal. 
The 2 5 tile with 42 internal edges may also be optimal; we have not found a better tile using 
heuristic search methods. For most of the other small tiles, we have found slightly better 
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tiles than those constructed using this method with brute force or greedy algorithm searches, 
or by slight modifications of the score-based stratification algorithm described above; for 
example as mentioned above for the 3 4 tile we have found a tile with 150 internal edges. For 
larger tiles, we do not know of any general method for generating tiles with significantly more 
internal edges. As we now discuss, we expect that the asymptotic behavior of these tiles may 
be the optimal achievable. Note, however, that for larger values of K the approach to the 
asymptotic form is slower and the tiles produced by this method are somewhat sub-optimal. 
In particular, when K is large and M is small, many nodes have no digits di — K — 1, which 
leads to extra broken edges. This might be improved by some heuristic method combining 
such nodes into sub-tiles which are then attached to increase total edge numbers. But, as 
M — > oo, for any fixed K the fraction of such nodes goes to exponentially as \{K — 1)/K] M , 
so this issue does not affect the asymptotics. 

Note that in [9j a somewhat similar approach was taken to tiling degree 2 de Bruijn 
graphs B2 ' ■ In the language of score-based tiles used here, the tilings described in that paper 
can be defined by assigning a score of fa = when did i+1 = 01 and fa = I otherwise, with 
ties broken by choosing k to be the largest i with fa = 0. As described in [9], this gives a tile 
for M = 5 with = 32 internal edges (compared to 42 from the table above for the system 
defined here). The tiles described in j9] also do not have good asymptotic behavior, although 
as mentioned in that paper a generalization of their approach to match longer sequences can 
improve behavior in a fixed range of N. 

7.2 Asymptotics of tiles for local pattern-based scores 

We now analyze the asymptotic behavior of the number of broken edges in the K-axy ex- 
pansion score-based tiles for fixed K as M — > 00. We find that these tiles have the same 
asymptotic form for the number of broken edges as the lower bound ( |T2|) . multiplied by an 
overall factor of c = 2. This asymptotic behavior can be understood from a simple idealized 
model for the score-based tiles. We give a simple proof of the asymptotics of the number 
of broken edges in this idealized model, and then describe how the tiles just defined deviate 
from the idealized model and the consequences of these deviations for the asymptotic form 
of the number of broken edges. In Section [7.2.31 we numerically analyze the score-based tiles 
and compare to the theoretical asymptotics. 

7.2.1 Idealized model 

Consider an infinite random walk on the de Bruijn graph underlying a given tile. Such 
a random walk will on average traverse each edge an equal number of times, so the fraction 
of edges traversed which are broken will be equal to the fraction of broken edges on the tile. 
In an idealized model of the score-based tile system described in the previous subsection, 
we assume that each node of size M has scores fa which are uniformly and independently 
distributed random numbers between and 1. We assume that the scores are completely 
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local, so that a node with the score sequence fa - ■ ■ 4>m has children with score sequences 
02 • • • 0m0m+i for various values of 0m+i- In the idealized model we can associate the 
random walk on the de Bruijn graph underlying the tile with an infinite sequence of numbers 
X1X2 ■ ■ ', each chosen from a uniform random distribution on the set [0, 1). The values of Xi 
correspond to the sequence of scores 1 — fa on the nodes encountered in the random walk 
(while for fa we used the convention that the smallest determined the level of a node, we 
reverse this convention for the re's to simplify the computations below). In this idealization 
we ignore correlations between the values of fa at different points i, so each Xj is chosen 
independently from the uniform distribution. We associate each "node" in the random walk 
with a subsequence of M numbers Si = x i+ iX i+2 ■ ■ -x i+ M- In this model, there are no ties. 
An edge (xi ■ ■ x 2 ■ ■ -xmXm+i) is broken only if either x\ or xm+i is greater than all of 
X2, ■ ■ ■ ,xm- Heuristically, the chances of either occurrence are 1/M, giving a probability of 
broken edges of 2/M. 

To make this computation more precise, for each consecutive subsequence of M numbers 
Si = x i+ ix i+2 ■ ■ ■ x i+ M representing a node we assign a level k where x i+ k is the largest value 
in the subsequence. We thus have a sequence of values k(i) defining the positions of the 
largest re's in each subsequence Si. If a value x i+ k is the largest x in subsequence Si, and also 
is the largest x in S i+ i, then k(i + 1) = k(i) — 1. We say that the sequence has a "broken 
edge" at position % when k(i + 1) ^ k(i) — 1. We can now show 

Theorem 4 In this idealized model, as M — > oo the probability of a broken edge asymptot- 
ically approaches 2/M. 



Consider the set of "local maximum" x^s which are maximum values in some subsequence 
Sj in which they are contained. There is a broken edge precisely when the maximum x in 
one subsequence Si is replaced by a new maximum x' in the subsequence Si+i. Thus, the 
frequency of broken edges is the same as the frequency of local maxima. For a given number 
x we can determine the probability that it is the largest x in a window of size precisely m. For 
example, X{ = x is the largest in a window of size precisely 1 when Xi-i > x,Xi + \ > x. This 
occurs with probability (1 — x) 2 . a; is the largest in a window of size precisely 2 when either 
Xi-2 > x,Xi-\ < x,Xi + i > x or the symmetric condition with x > Xi + i but Xi-\,Xi + 2 > x. 
The probability that x is largest in a window of size 2 is therefore 2x(l —x) 2 . A similar set of 
conditions give m independent cases with probability x m_1 (l — x) 2 where x is the maximum 
in a window of size m. Integrating over all possible x, the probability that a given x will be 
largest in a window of precisely size m is 



Proof 
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the frequency of local maxima is then 



2 



f=} p(m) 



M + l 



(50) 



m=M 



This proves that in the idealized model the frequency of broken edges is 2/(M + 1) « 
2/M + C(l/M 2 ). 

This result in the idealized model suggests that the smallest fraction of broken edges 
we can achieve in a tile based on local patterns in the node address is asymptotically 2/M. 
Now let us consider the relevant differences between the idealized model and the K-ary 
score-based tiles defined in the previous subsection. 

7.2.2 Deviations from idealized model 

There are three primary ways in which the score-based tiles we have defined deviate from 
the idealized model. First, there are some situations in which different positions have the 
same score fa = <pj for a fixed node. Second, the scores are not completely independent. 
Third, the scores fa depend on multiple local symbols dk, and therefore in particular the 
score associated with a point i in a node does not necessarily stay invariant in the children 
of that node — we have fa_i(v) ^ <fii{u) along an edge (u, v) whenever the final symbol in v 
is not K — 1 . 

Let us treat these deviations from the idealized model in order. First, consider the 
situation of ties. Generally, ties will increase the frequency of broken edges. For example, 
consider the variation on the above model where Xi is chosen randomly from a uniform 
distribution on the discrete set {k/D, < k < D}, and ties are broken by choosing k{i) to 
be the smallest k with the maximum value of Xi + j~ in the set Si. Then when M 3> D, the 
frequency of broken edges will be 1/D ^> 1/M. (The approach taken in [9] is an extreme 
example of this; their rule for constructing tiles can be formulated in the language of scores 
as described above by taking a score which is either fa = if didi+i = 01 or fa = 1 otherwise. 
This gives a frequency of broken edges of 1/4 for large M.) 

As noted above, for the K-axy score based tiles defined in I7.1[ ties only occur for nodes 
with d\, . . . , d n -i < K — 1, d n = ■ ■ ■ du — K — 1. The total number of nodes where such ties 
occur is 



The fraction of nodes where ties happen therefore decreases exponentially in M as [(K — 
1)/K] M 1/M when M — y oo. The consequences of ties are therefore negligible in the 
asymptotic behavior of the number of broken edges. 

Next, let us consider the other two issues: the lack of independence between nearby scores 
and the non-invariance of scores when edges are followed. Each of these effects arises from 
dependence of the score fa on all the symbols d i+ k for any k > 0. From the form of ( T4"5l) we 



iV ties = 1 + (K - l) 1 + (K - l) 2 + ■ • • (K - 1) 



M-l 



(K- 1) M - 1 
K-2 



(51) 
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see that the effect of d i+ k on the score 0; is suppressed exponentially as 1/K k . For a node in 
a tile of size M, the average spacing between the scores of the M different positions i is 1/M. 
Thus, the impact of di+k becomes negligible compared to this spacing when k 3> log^- M. 
Another way to see this is to note that the smallest will arise from the longest sequence 
of O's in the node which is preceded by the symbol K — 1. As M — > oo, the average number 
of sequences of n zeros in a random node address goes as M/K n . Thus, we expect 0(1) 
sequences of log K M O's but 0{1/M n ) sequences of n + 1 times this many O's. So again, the 
number of relevant digits in determining the minimum <ft is of order 0(log K M). The relevant 
correlation distance between 0's is thus also log^ M. Similarly, when a new digit sequence 
giving the lowest is being shifted in from the right, asymptotically of order 0(log K M) 
digits must be shifted in to realize the smaller value of 0. 

The upshot of this analysis is that there are corrections to the asymptotic form of the 
idealized model of order (log^ M)/M when we consider the score-based tiles described above. 
The most important of these effects is the delay by log K M digits in shifting in a new score 
0j. When M — i < log K M, 0j will be higher than the appropriately shifted value in any 
of the descendants of a given node due to the boundary condition d n — K — 1 for n > M 
described below fj4"5]) . This means that the effective size of the window associated with the 
tile is really M — 0(\og K M), which contributes a correction term of order (log^ M) /M to the 
asymptotic form of the fraction of broken edges 2/M for the idealized model demonstrated 
in Theorem 4. 

Because all of these effects are either exponentially suppressed in M or suppressed by 
a factor of (log K M)/M relative to the leading 2/M, none of these effects will modify the 
asymptotic form of the number of broken edges for the score-based tiles described above. 
We now verify this analysis with numerical computation. 

7.2.3 Numerical verification of asymptotics 

To verify the asymptotic analysis performed in the previous subsection, and the validity of 
the approximations made in going from the score-based tiling system to the idealized model, 
we have done a numerical analysis of the number of broken edges for tiles with small K 
and reasonably large M. This data corresponds closely with the theoretical analysis just 
performed. 

In Figure [Hi] we have graphed the logarithm of the number of broken edges in the 2- 
ary expansion score-based tiles for K = 2, M < 20. The number of broken edges in these 
tiles matches very closely to the theoretically estimated fraction of 2/M, and lies above the 
asymptotic form of the minimum fraction 1/M. Note that for M > 11, the number of broken 
edges exceeds the fraction 2/M of the total number of edges. This is compatible, however, 
with the expected (log^ M)/M form of the corrections to the asymptotic form computed 
above. For M > 11 the computed number of broken edges lies between the asymptotic form 
2/M and the log-corrected asymptotic form 2/(M + 1 — (log^ M)/2). 

For tiles with K = 3 we have a similar close match to the asymptotic fraction 2/M of 
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Figure 10: (Logarithm of) number of broken edges for 2-ary expansion score based tile, compared to 
theoretical asymptotic fraction 2/M and asymptotic lower bound 1/M. 

broken edges, as depicted in Figure [TTJ 

To summarize, from our analysis of an idealized model, we expect that a fraction of 
2/M broken edges is asymptotically the optimum which can be achieved based on a local 
pattern for level determination. We have constructed a family of tiles which realize this 
asymptotic form, and are thus asymptotically optimal for local pattern-based tile structures. 
This asymptotic result differs from the lower bound computed in Section [3] by a factor of 2. 
We suspect that it is not possible to find tilings with an asymptotic fraction of less than 2/M 
broken edges, based on our asymptotic analysis and brute force and greedy algorithm analysis 
of small tiles, but we have not proven this conclusively. Any mechanism for constructing 
tiles with an asymptotic fraction of c/M broken edges with c < 2 would require a means of 
determining the level of each node based on global properties of the dd node address, like 
some kind of global hashing function, rather than a determination of level based on local 
patterns such as we have considered here. 

8 Application to supercomputers 

The results of this paper give not only a theoretical understanding of how de Bruijn and 
Kautz graphs can be decomposed into isomorphic subgraph tiles, but also a concrete ap- 
proach to constructing such tilings. Computer systems containing thousands of individual 
processing elements need efficient communication networks to minimize overhead in passing 
data between the processors. Because of their high degree of connectivity, de Bruijn and 
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Figure 11: (Logarithm of) number of broken edges for 3-ary expansion score based tile, compared to 
theoretical asymptotic fraction 2/M and asymptotic lower bound 1/M. 

Kautz graphs are very well suited to such large scale processing networks. The practical 
problem of wiring together many processors in such a network is substantially simplified by 
the approach of combining multiple processing units into tiles with isomorphic wiring, and 
then combining the tiles as we have described in this paper. 

In principle, the methods described in this paper can be used to design wiring systems for 
computing systems at a range of scales. For any given size of system, the tradeoff between 
communication and processing power will affect the choice of degree K, and practical design 
considerations will affect the choice of tile size K . The results described here should be 
useful in determining the complexity of wiring needed for any such system design. In partic- 
ular, the lower bound on edges connecting tiles derived in Section [3] gives an absolute lower 
limit on the complexity of wiring necessary for such a system. The explicit tile constructions 
given here give concrete examples of wiring patterns which can be used for such systems. 

The feasibility of a practical implementation of the tiling methods developed in this paper 
is demonstrated by the Sicortex, Inc. family of cluster computer systems [12] . The internal 
communication network in these systems is based on a degree K = 3 Kautz digraph. Circuit 
boards contain processor notes connected in a stratified subgraph of a degree 3, diameter 3 
de Bruijn digraph, so that a full Kautz graph of any desired size can be wired by connecting 
identical boards as described in Theorem 3, taking advantage of the parallel routing property. 
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9 Conclusions 



In this paper we have developed a systematic approach to partitioning de Bruijn and Kautz 
graphs into isomorphic subgraph "tiles" connected by a minimal number of additional edges. 
These results utilize the common mathematical structure underlying de Bruijn and Kautz 
graphs, and shed light on the structure of these graphs and their generalizations. The tilings 
we have constructed here have practical application to the construction of massively parallel 
computer systems. 

We have given necessary and sufficient conditions for constructing efficient tilings, we 
have characterized optimal tiles in terms of de Bruijn graphs of the size of the desired tile, 
and we have constructed an asymptotically optimal class of tilings. We have not, however, 
found a general method for explicitly constructing provably optimal tiles of arbitrary size, 
nor have we found a general formula for the number of internal edges achievable by an 
optimal tile of arbitrary size. We leave these open problems for future work. 
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